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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

UNITED STATES PATENT APPLICATION 

Title: Film Source Video Detection 

Inventor: Peter D. Swartz 

Field of the Invention 

The present invention relates to television signal processing. More particularly, the 

present invention relates to improved signal processing apparatus and methods for detecting 
motion picture film sources in television signals. 

5 n 

Cross Reference to Related Application 
0-1 Portions of the present application are common to the copending application of Yves 

O C. Faroudja, Peter D. Swartz and Jack C. Campbell entitled "High-definition Television 
1 Signal Processing for Transmitting and Receiving a Television Signal in a Manner 
10 = Compatible with the Present System," Attorneys' Docket FAR004, filed the same day as the 

III present application. 

O Description of Related Art 

In accordance with United States Patent 4,876>9~6, it is known to detect that a 60 Hz 

15 NTSC television signal has as its source a 24 frame/second motion picture film. A 24 

frame/second film is applied to and synchronized with 60 Hz NTSC video by using a 3-2 
film pull-down ratio such that three video fields come from one film frame and the 
subsequent two video fields come from the next film frame (e.g., AABBBCCDDDEE, etc., 
where "A," "B," etc. are successive film frames). Video fields of the same parity in 

20 adjacent video frames are compared. When motion is present, a distinctive repeating pattern 
is obtained only if the video source is film (i.e., 1101111011, etc., where "1" indicates 
motion and "0" indicates no motion). 
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United States Patent 4,982,280 discloses an arrangement for detecting a 30 Hz 
progressive scan source, which may be a video camera or film, in a 60 Hz progressively 
scanned non-interlaced television system. In the presence of motion from frame-to-frame 
within the 30 Hz progressively scanned video image sequences, when the video sequences in 
5 adjacent video fields are compared a 010101, etc. pattern results, indicating a 30 Hz 

progressively scanned source. Comparison of every second video field, to yield a 111111, 
etc., pattern, further confirms the 30 Hz progressive scan source. In order to operate, the 
arrangement relies on the spatial coincidence of pixels in the successive identical pairs of 
fields of the progressive scan television signal. 

10 Published International Patent Application WO 94/30006 discloses apparatus for 

O detecting twenty-five frame per second motion picture film sources in 50 Hz television 

5 signals, such as PAL, SEC AM, 50 Hz NTSC, 50 Hz HDTV and 50 Hz monochrome 

]% television signals. The detection scheme looks for differences in motion between a field and 

O the previous field, on one hand, and between the same field and the subsequent field, on the 

15 Hj other hand. 

]^ The detection of motion picture film sources in television signals is useful in various 

. r II environments, for example, a line doubler, a television standards converter, a television slow 

motion processor and a video information compressor. 

J f A well-known "line-doubling" technique employs the conversion of an interlaced 

20 television signal (such as employed in the NTSC, PAL and SEC AM systems), in which two 
time consecutive interleaved fields of scan lines represent a frame, to a non-interlaced or 
progressively scanned television signal, in which all of the scan lines in the frame are 
scanned out sequentially. The result, in the case of NTSC, is that 525 scan lines are 
presented every one-sixtieth of a second rather than 262.5 scan lines, thus potentially 

25 doubling the vertical resolution to the full frame resolution of 525 lines. Typically, line 

doublers employ, at least part of the time, field merging or inter-field interpolation, which 

provides a full doubling of the vertical resolution. Pairs of fields are combined to form a 
single progressively scanned frame. Such arrangements are described in U.S. Patents 
4,876,956; 4,967,271; 4,982,280; 4,989,090, 5,159,451 and 5,291,280. 
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One problem resulting from field merging is that there is a time difference between 
fields in the interlaced television signal. Thus, a moving object may be in different locations 
from field to field and, if two such fields are combined to form a single frame, the vertical 
and diagonal edges of objects moving horizontally or diagonally appear as ragged sawtooths 
5 rather than smooth curves. Line doubling is lost along such transitions. The sawtooth effect 
is exacerbated if the reproduction system employs vertical enhancement. A sawtooth pattern 
is an artifact created by the incorrect interleaving of two fields arising from different film 
frames. Visually what should be a continuous vertical or diagonal line will appear as a 
zigzag on the edge. 

10 One solution to this problem is to employ some type of scan line or intra-field 

n interpolation in regions of the television picture display in which movement is occurring. An 
4 improved technique for providing intra-field interpolation in areas of the picture having 
m motion is set forth in said U.S. Patent 4,989,090. 

□i A potential problem with switchable inter-field/ intra-field interpolation line doublers is 

15 that motion may not be properly detected at all times, thus allowing the above mentioned 
jagged vertical or diagonal transition artifacts to appear briefly and intermittently. Such 
arrangements may have particular difficulty in consistently switching correctly between 
^ operating regimes when operating on an interlaced television signal that has been subject to 
O data compression or standards conversion (as from PAL to NTSC, for example). 
20 ;: Another known solution to the problem is to recognize when the source of the 

interlaced television signal is a motion picture film. As mentioned above, in the United 
States, 24 frame per second motion picture films are applied to the 30 frame per second 
NTSC television signal by means of a "3-2 pulldown ratio," such that a patterned sequence 
of 3 film frames, 2 film frames, 3 film frames, etc. are carried by consecutive television 
25 fields. By identifying pairs of interlaced fields carrying the same film frame, the pairs may 
be merged to generate essentially perfect line doubled progressively scanned frames, each 
representing a motion picture frame. Such techniques are described in said U.S. Patents 
4,876,596 , 4,982,280 and 5,291,280. 
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However, the motion picture film detection arrangement is also subject to problems. 
For example, a video edit (rather than a film edit) may disrupt the 3-2 pulldown ratio, 

causing merged television fields to be derived from different film frames. Thus, until the 

film pulldown ratio detection circuitry resynchronizes itself (typically one to four fields of the 

5 input interlaced television signal), the ragged sawtooth transition artifacts, described above, 
may appear at the edges of moving objects. Although the undesired effect is fleeting, it is 
perceptible to the eye, occasionally disturbing an otherwise near perfect line doubling 
arrangement. 

The last discussed problem may also arise in the motion picture film source line 

10 doubling arrangement when the input interlaced television signal fades from a film source to 

0 a video source. In that case, the system, as described, for example, in said U.S. Patent 

: - j 4,982,280, continues to operate in its "film mode" for a few fields before recognizing the 

•i % loss of the 3-2 pull down pattern and switching to an inter field/ intra-field interpolation 
arrangement of the sort described above. During those few fields before changeover from 
15 : 4 the film mode of operation, the same sort of ragged sawtooth artifacts may be generated if 
n there is motion between the fields. In addition, the problem may also arise when the 

1 ;:f interlaced television signal derived from a motion picture source has been imperfectly data 
i:Q compressed or standards converted. 

" : Z One solution to the problem of video edits disrupting the 3-2 pulldown ratio is set 

20 forth in said U.S. Patent 5,291,280. The solution employs a field motion detector and 

includes circuitry to "look ahead" for video edits. However, as is explained in connection 
with the various aspects of the present invention, there are shortcomings in said 5,291,280 
patent and in other prior art such that there is still a need to provide a better film motion 
detector for use in line doublers and in other applications as mentioned above. 
25 Summary of the Invention 

The present invention has a number of aspects — an improved field motion detector 
which does not treat low frequency vertical transitions as motion; a frame motion detector 
having an improved ability to differentiate motion from subcarrier signal components; a 
sawtooth artifact detector; a sawtooth artifact detector in combination with a film pattern 
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detector, such that the artifact detector can take the film pattern detector out of film mode 
earlier than it would if it only were responsive to a break in the film pattern; tandem field 
motion detectors; an improved field based film detector; film pattern detectors and motion 
detectors used therewith which operate by performing end-of-field calculations; the 
combination of a field motion detector and a frame motion detector such that the frame 

motion detector provides a motion signal used as a verification by the field motion detector; 
an improved NTSC film detector requiring a minimum number of NTSC film pattern 
sequences; an improved PAL film detector employing a minimum motion threshold detector; 
combinations of these various aspects; and a line doubler (interlaced to progressive scan 
converter) incorporating one or more of these various aspects. 

Brief Description of the Drawings 
Figure 1 is a functional block diagram showing an overall view of the present 
invention, including its various aspects. 

Figure 2 is a quasi-three-dimensional representation of three consecutive interlaced- 
scan television fields, showing relative pixel locations within the respective fields. 

Figure 3 is a functional block diagram showing an interlace to line doubled 
progressive scan converter environment in which the present invention or aspects thereof may 

be employed. 

Figure 4 is a functional block diagram showing details of the field motion detector of 

Figure 1 . 

Figures 5A-F are a series of idealized diagrams showing pixel amplitudes in video 
streams for the case of a prior art field motion detector having a high-frequency (sharp) 
vertical transition with no motion. 

Figures 6A-F are a series of idealized diagrams showing pixel amplitudes in video 
streams for the case of a prior art field motion detector having a high-frequency (sharp) 
vertical transition with motion. 
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Figures 7A-F are a series of idealized diagrams showing pixel amplitudes in video 
streams for the case of a prior art field motion detector having a low-frequency (broad) 
vertical transition with no motion. 

Figures 8A-F are a series of idealized diagrams showing pixel amplitudes in video 
streams for the case of a field motion detector according to an aspect of the present invention 
having a high-frequency (sharp) vertical transition with no motion. 

Figures 9A-F are a series of idealized diagrams showing pixel amplitudes in video 
streams for the case of a field motion detector according to an aspect of the present invention 
having a high-frequency (sharp) vertical transition with motion. 

Figures 10A-F are a series of idealized diagrams showing pixel amplitudes in video 
streams for the case of a field motion detector according to an aspect of the present invention 
having a low-frequency (broad) vertical transition with no motion. 

Figure 1 1 is a functional block diagram showing details of the frame motion detector 
of Figure 1. 

Figure 12 is a functional block diagram showing details of the NTSC film detector of 
Figure 1. 

Figure 13 is a functional block diagram showing details of the PAL film detector of 

Figure 1. 

Figure 14 is a functional block diagram showing details of the sawtooth detectors of 
Figure 1. 

Figure 15A-D are idealized representations of correctly and incorrectly interleaved 
scan lines, the incorrectly interleaved scan lines displaying the sawtooth artifact. 

Figure 16 is a functional block diagram showing details of the sawtooth based edit 
detector of Figure 1 . 

Incorporation by Reference 
Each of the following United States Patents, mentioned in the present application, are 
hereby incorporated by reference in their entirety: 4,876,596; 4,967,271; 4,982,280; 

4,989,090; 5,159,451; 5,291,280; 5,488,422; and 5,625,421. 
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Detailed Description of the Preferred Embodiments 
Except as noted herein, practical embodiments of the invention may be implemented 
using analog, digital (including digital signal processing with software), or hybrid 
analog/digital techniques. The equivalency of analog and digital implementations in most 
contexts is well known to those of ordinary skill in the art. 

Throughout this document, for simplicity, signal inputs and outputs are drawn as 
single points and signal carrying lines are drawn as single lines. It will be understood that, 
in practice, more than one input or output point and more than one signal carrying line may 
be required, depending on the format of the signals and the manner in which the practical 
embodiments of the invention are physically constructed. 

It should also be understood that functions shown separately may be combined with 
others in whole or in part. In addition, those of ordinary skill in the art will understand that, 

in practice, switches shown throughout the various figures will be implemented electronically 
or in software rather than mechanically. 

The invention or aspects of the invention may be advantageously employed in 
connection with an interlace to line doubled progressive scan converter which, for example, 
precedes a video data compressor or a standards convener. Both video compression and 
standards conversion are more accurately and easily accomplished when operating on a 
progressively scanned television signal rather than an interlaced television signal. By 
employing the present invention in a progressive scan converter preceding such video 
compression or standards conversion, such processes are assured to be operating on a more 
optimum signal source. 

An overall view of the present invention, including its various aspects is shown in 
Figure 1. Figure 3, described below, shows an interlace to line doubled progressive scan 
converter environment in which the present invention or aspects thereof may be employed. 

Referring now to Figure 1, a signal Y in is applied to an array of field and line 
memories 102. The signal Y ra may be a monochrome video signal or the luminance 
components of a component color video signal. Those of ordinary skill in the art will " 
understand that a component video signal may comprise analog or digital components such as 
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RGB (from which luminance and chrominance components may be derived), Y/I/Q, Y/U/V, 
Y/R-Y/B-Y, Y/Cr/Cb, etc. Furthermore, in the case of digital components, the received 
digital component video signal may be in any of a number of compressed or uncompressed 
formats, including, for example, various ones of the digital component video formats in 
accordance with the recommendations, standards or compression algorithms of the CCIR 
(International Radio Consultative Committee) (such as the hierarchy of digital video coding 
formats under CCIR Recommendation 601, the 4:2:2 format often being referred to as a 
CCIR 601 video signal), ISO/MPEG (the Motion Picture Experts Group of the International 
Standards Organization), SMPTE (Society of Motion Picrure and Television Engineers), EBU 
(European Broadcasting Union), and/or the recommendations or standards of other industry, 
governmental or quasi-governmental bodies. 

The array of memories 102 provides five outputs comprising time-delayed versions of 
Y in , which, along with Y^ itself, provide six time-spaced versions of the input luminance 
signal: two in a first television field F0, two in a second television field Fl and two in a 
third television field F2. 

The time-spacing of the six signal streams with respect to the spatial location of pixels 
(television picture elements) in the three consecutive fields may be better understood by 
reference to the representation of Figure 2. Figure 2 is a quasi-three-dimensional 
representation in which the vertical and horizontal dimensions lie in the plane of a television 
picture field such that the vertical axis is the vertical direction of a television picture field, 
perpendicular to the scan lines, the horizontal axis is the horizontal direction of the television 
picture field, parallel to the scan lines, and the direction perpendicular to the plane of each 
field is a quasi-time axis in which each consecutive field is shown at a discrete time. At 
some arbitrary time during the active picture information of a television field, a pixel may be 
represented as point P(FO-IH) in field F0. That pixel is shown in Figure 2 as a point in a 

horizontal scan line. A second pixel in field FO, pixel PFO, occurring one line later in time, 

is directly below pixel P(FO-IH) vertically and is shown as a point in the next lower 
horizontal scan line. A first pixel in field Fl, occurring 262 lines (in the case of NTSC, for 
PAL, 312 lines) after pixel P(F0), and a second pixel in field Fl, occurring 263 lines (in the 
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case of NTSC, for PAL 313 lines) after pixel P(F0), may referred to as pixel P(F1-%H) and 
pixel P(F1 + V2H), respectively. Pixel PfFl-^H), shown as a point in a first horizontal scan 
line, is directly above pixel P(F1 + 1 AH) vertically, which pixel is shown as a point in the 
next lower horizontal scan line, A point P(F1) in field Fl, corresponding spatially to the 
location in which pixel P(F0) of field FO lies, is half way between pixel P(F1-V£H) and pixel 
P(F1 + V2H) due to the interlacing offset of horizontal scan lines from field to field. Thus, 
pixel PCFl-ViH) precedes, by the time of one-half line, the point P(F1), while pixel 
P(F1 + V2H) is, by the time of one-half line, after the point P(F1). The spatial point in field 
F2 corresponding to FO and Fl is the location of pixel P(F2), occurring 525 lines, exactly 
one frame or two fields, after pixel P(F0). Pixel P(F2) is shown as a point in a horizontal 
scanning line in field F2. A second pixel in field F2, P(F2-1H), occurring one horizontal 
line time earlier, is shown as a point in the horizontal scan line immediately above the scan 
line containing pixel P(F0). 

Returning again to Figure 1, wherein the details of the array of field and line 
memories 102 are shown, the undelayed input luminance signal stream Y lQ itself provides the 
F0-1H output stream. The input luminance signal stream Y m is applied to a first 1H (one 
horizontal line time) delay 104 to provide the FO outpux stream. The output of delay 104 is 
also applied to a first nH delay 106 (where n is 262 lines for NTSC, 312 lines for PAL) to 
provide the FI-V2H output stream. The delay 106 output is applied to a further 1H delay 
108, to provide the FI + V2H output stream and to a funher nH delay 110 (where n is 262 
lines for NTSC, 312 lines for PAL) 10 provide the F2-1H output stream. The delay 110 
output is also applied to yet a farther 1H delay 112 to provide the F2 output stream. The 
delays may be implemented by various hardware, software and hybrid hardware/software 
techniques well known to those of ordinary skill in the art. Although the delays are shown 
as series delays, they may be implemented in other ways such as by a multiport random 
access memory in which the signal stream is read in once and read out multiple times or by 
other equivalent ways. 

Thus, the array of memories 102 provides six signal output streams corresponding to 
six pixel locations, two in each of three consecutive fields: pixels at time positions F0-1H 
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and FO in field FO, pixels at time positions FI-V2H and F1 + V4H in field Fl, and pixels at 
time positions F2-1H and F2 in field 2. 

A field motion detector 114, described below, receives the FO, FI-V2H and FI + V2H 
signal streams, signals spaced apart in time by one field minus one-half line and by one field 
plus one-half line. It is known to apply such inputs to field motion detectors — see, for 
example, U.S. Patents 4,982,280 and 5,291,280. The purpose of the field motion detector is 
to detect motion (having a velocity greater than, for example, one half line per field) from 
interlaced field to interlaced field without falsely detecting motion when a vertical transition 
occurs (for example, when the portion of the picture below a horizontal line is black and the 
portion of the picture above the line is white, or vice- versa). Although a field motion 
detector of the type disclosed in said 5,291,280 patent may be employed in aspects of the 
present invention, it is preferred to employ the improved field motion detector described 
below, which forms an aspect of the present invention, because of its greater resistance to 
false detection caused by certain types of vertical transitions. While a field motion detector 
such as that of the 4,982,280 patent may be employed in aspects of the present invention, the 
field motion detector of said 5,291,280 patent is preferable to that of the 4,982,280 patent in 
that the earlier patent provides no immunity to false detection resulting from vertical 
transitions. The output of the field motion detector 114, F0 mm , is applied to an NTSC film 
detector 118 and to a PAL film detector 120, each of which is described below. 

Optionally, a second field motion detector (not shown) below may receive the F2, Fl- 
V2E and FI + V2H signal streams, signals also spaced apart in time by one field minus one- 
half line and by one field plus one-half line. Such a second field motion detector could be 
used to provide additional redundancy or to provide increased accuracy in processing. 

The F0 and F2 outputs of memories 102 are applied to a frame motion detector 114 
where the two signals are compared on a pixel by pixel basis. The output of the frame 
motion detector, "MOTION," is applied to the NTSC film detector 118, the PAL film 
detector 120, and to a sawtooth based edit detector 126, described below. The MOTION 
signal may also be applied as an input to an interlace-to-line-doubled progressive-scan 
converter as shown in Figure 3. Frame motion detectors receiving F0 and F2 signal stream 

5 
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inputs are known in the prior art — see, for example, said U.S. Patent 4,982,280, said U.S. 
Patent 4,967,271 and said U.S. Patent 5,291,280. Various frame motion detectors, including 
those of said 4,982,280, 4,967,271 and 5,291,280 patents, may be employed as the frame 
motion detector in aspects of the present invention. However, it is preferred to employ the 
improved frame motion detector, which forms an aspect of the present invention, described 
below. 

The F0-1H, F0, F1-1H and F1 + 1H signal streams and the F2, F2-1H, F1-1H and 
F1 + 1H signal streams, respectively, are applied to an F0 sawtooth detector 128 and an F2 
sawtooth detector 130. Each sawtooth detector looks for the zigzag artifact that an incorrect 
interleaving of field 1 and field 0 or field 1 and field 2 would yield. The outputs of these 
two detectors, F0 saw and F2 saw , respectively, are applied to the sawtooth-based edit detector 
126. Sawtooth detectors are disclosed in United States Patent 5,625,421. The sawtooth 
detection techniques forming an aspect of the present invention differ from those in said 
5,625,421 patent in that the sawtooth detectors of the patent operate after interlace-to- 
progressive conversion (after line doubling) instead of before such conversion. 

As explained further below, the NTSC and PAL film detectors (118 and 120, 
respectively) examine the accumulated field motion between field 0 and field 1 and then 
□ search for the field motion film sequence pattern (a "10100" pattern in NTSC and a "10" 
q pattern in PAL) before determining that the material is film and not video. In said 4,982,280 
patent, frames, instead of fields, are compared in the film detectors. The film detectors of 
the 4,982,280 patent impose a minimum size constraint instead of accumulating pixel 
differences. Nevertheless, a state machine as in said 4,982,280 patent may be employed in 
the film detectors of the present invention. The film detectors are reset by an edit Y/N 
(yes/no) signal when a "bad edit" is detected. A bad edit means an edit performed in video 
on film material when two complete film frames have not been removed, thereby breaking 
the 3-2 pulldown ratio. A film sequence might look like 

AAABBCCCDDEEEFF, 

a good edit might look like 

AAABBEEEFF, and 
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a bad edit might look like 

AAACCCDDEEEFF, 

where A, B, etc. are video fields derived from a particular motion picture frame (frame A, 
B, etc.). 

The outputs of the film detectors are two signals, a film Y/N (yes/no) signal and an 
F0/F2 signal. The film Y/N chooses the mode under which the line doubler functions (see 
Figure 3). The F0/F2 signal selects which field signal stream, FO or F2, for the line doubler 
(see Figure 3) to interleave (merge) with the F1-V4H when the material is film. The film 
Y/N and F0/F2 signal may be applied as inputs to an interlace-to-line-doubled progressive- 
scan converter as shown in Figure 3 . 

The outputs of one of the two film detectors are selected by- a double-pole double- 
throw switch 132. The switch is controlled by an externally supplied NTSC/PAL signal. 
This selects the appropriate film detector to be active dependent upon the input video format. 
The PAL film detector may also be employed to detect 30 frame/second sources in NTSC 
signals, in which case the switch 132 is placed in the PAL switch position. 

The output of the FO and F2 sawtooth detectors (128 and 130), F0 saw and F2 saw along 
with the motion signal, film Y/N and F0/F2 signals are combined in the sawtooth based edit 
detector 126 to create the edit Y/N signal used for resetting the film detectors 118 and 120. 
An edit Y signal immediately resets the film Y/N sianal to N. 

Referring now to Figure 3 , which shows an interlace to line doubled progressive scan 
converter environment in which the present invention or aspects thereof may be employed, 

the input signal Y ia (see comments above regarding the nature of Y in ) is applied to block 302, 
which includes the various functions of the Figure 1 arrangement. Block 302 provides a first 
video stream having the F0 timing, a second video stream having the Fl-ViH timing (i.e., 
nominally 1 field time after the F0 stream timing), and a third video stream havins the F2 
timing (where FO, Fl-VzH and F2 are as described in connection with Figures 1 and 2). The 
FI-V2H video stream provides the "direct" field Y dir to one input of a line doubler pair 304. 
The F0 and F2 video streams are applied to a non-film video processor 306 and a film video 
processor 308. Block 302 applies its F0/F2 output to the film video processor 308 and its 
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film Y/N output to the control input of a single-pole double-throw switch 310. The output of 
switch 310 provides the "interpolated" field, Y mt , to the other input of the line doubler pair 

304. The outputs from the line doubler pairs 304 are combined to provide the output signal 
Y. 



out* 



In operation, scanning lines from the interpolated field Y im are interleaved with the 
scanning lines from the direct field Y dir . The interpolated field scanning lines are taken 
either from the F0 or the F2 video stream, in the case of a motion picture film source, or, 
when the source is not motion picture film, are synthesized by any of various known 
techniques such as intra-field interpolation, inter-field interpolation, a combination of intra- 
field and inter-field interpolation, or some other known technique. The example of Figure 3 
assumes that the non-film video processor 306 operates adaptively in response to the applied 
motion signal such that it employs inter-field interpolation for conditions of no motion or low 
motion (for example, by averaging F0 and F2) and intra-field interpolation for conditions of 
medium to high motion. The film video processor selects either the F0 or F2 video stream 
in response to the F0/F2 signal when a film source is detected, assuring that the correct field 
is merged with the direct field. 

Field Motion Detector 

Details of the field motion detector of Figure 1 are shown in Figure 4. The field 
motion detector compares temporally adjacent pixel information of opposing field parity 
(field 0 and field 1, for example) and differentiates field-to-field motion from vertical picture 
transitions to provide a field motion signal. 

Each field motion detector includes three subtracters (402, 404 and 406), a keep 
smaller absolute value function 408, a magnitude comparator 410, and a motion/no-motion 
switch 412. Subtracter 402 receives the F1-V4H and F0 signals. Subtracter 404 receives the 
F0 and FI + V2H signals. Subtracter 406 receives the F1-&H and F1 + V4H signals. The 

field motion detector described in U.S. Patent 5,291,280 (see, e.g., Figure 3) employs 

subtracters 402 and 404, receiving the same inputs as just described, and the keep smaller 
absolute value function 408. The field motion detectors of the present invention may use 'the 
same keep smaller absolute value function arrangement as in said 5,291,280 patent. 
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Inter-field subtractions yield field motion but also incorrectly detect vertical transitions 
as motion. The keep smaller absolute value function chooses the smaller of the two inter- 
field subtractions and thereby differentiates high frequency vertical transitions from motion. 
Unfortunately low frequency vertical transitions are still detected as motion. To overcome 
this shortcoming, the field motion detector of the present invention adds a single line vertical 
differentiator (the third subtractor 406), whose output, a measure of vertical energy, is 
compared in magnitude comparator 410 to the output of the keep smaller absolute value 
function 408, an measure of field motion. Subtractor 406, functioning as a single line 
vertical differentiator, looks for a vertical transition within a field. If the mamitude of the 
vertical transition is greater than the magnitude of the field motion, then a vertical transition 
was incorrectly determined to be motion so the output switch goes to the NO position and no 
motion is detected. However, if the field motion is greater than the vertical energy, the 
motion value YES is output. This yields a more precise motion detection. It will be noted 
that sign of motion information signals from the subtractors are irrelevant in view of the keep 
smaller absolute value function. 

The inability of the prior an field motion detector to differentiate between low- 
frequency vertical transitions and motion versus the performance of the improved field 
motion detector of the present invention may be better understood by reference to Figures 5- 
10 which depict idealized representations of pixel amplitudes (in consecutive lines) (i.e., 
vertical pixels rather than horizontal pixels) versus time in time-spaced video streams for 
various signal conditions and the resulting field motion detector signal outputs. Figures 5-7 
relate to the prior art field motion detector as disclosed in said 5,291,280 patent (which, as 
noted above, forms a portion of the improved field motion detector of the present invention), 
while Figures 8-10 relate to the improved field motion detector of the present invention. In 
referring to Figures 5-10, reference may be made to Figure 2 to appreciate more fully the 
relative position of the various pixels in the interlaced video fields. 

Referring now to Figure 5 (prior art — high-frequency vertical transition, no motion), 
Figure 5 A shows pixel amplitudes in consecutive lines in the video stream F0; Figure 5B ' 
shows pixel amplitudes in the video stream FI-V2H (262 lines in time after the F0 video 
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stream); and Figure 5C shows pixel amplitude values in the video stream Fl + ! / 2 H (263 lines 
in time after the FO video stream). Subtractor 402 produces the output shown in Figure 5D 
(one pixel), while subtractor 404 produces the output shown in Figure 5E (no pixels). The 
keep smaller absolute value function 408 thus produces the output shown in Figure 5F (no 
pixels). This output, the output that would be provided by the field motion detector of said 
5,291,280 patent, is the Correct output — the high-frequency transition in the presence of no 
motion results in the correct "no motion" output for each of the pixel positions. 

Referring now to Figure 6 (prior art — high-frequency vertical transition, motion), 
Figure 6 A shows pixel amplitudes in consecutive lines in the video stream FO; Figure 6B 
shows pixel amplitudes in the video stream FI-V2H (262 lines in time after the FO video 
stream); and Figure 6C shows pixel amplitude values in the video stream FI + V2H (263 lines 
in time after the FO video stream). Subtractor 402 produces the output shown in Figure 6D 
(two pixels) , while subtractor 404 produces the output shown in Figure 6E (three pixels) . 
The keep smaller absolute value function 408 thus produces the output shown in Figure 6F 
(two pixels) . This output, the output that would be provided by the field motion detector of 
said 5,291,280 patent, is the correct output — the high-frequency transition in the presence 
of motion results in the correct "motion" output (motion signals at pixel positions in two 
lines, indicating the amplitude of the relative field to field motion). 

Referring now to Figure 7 (prior art — low-frequency transition, no motion), Figure 
7A shows pixel amplitudes in consecutive lines in the video stream F0; Fiaure 7B shows 
pixel amplitudes in the video stream Fl-y 2 H (262 lines in time after the F0 video stream); 
and Figure 7C shows pixel amplitude values in the video stream Fl + ViH (263 lines in time 
after the F0 video stream). Each shows slowly increasing pixel values as would occur in the 
case of a low-frequency vertical transition (a fade from black to white). Subtractor 402 
produces the output shown in Figure 7D (four low level pixels), while subtractor 404 
produces the output shown in Figure 7E (four low level pixels). The keep smaller absolute 
value function 408 thus produces the output shown in Figure 7F (three low level pixels) . 
This output, shown in Figure 7H, the output that would be provided by the field motion * 
detector of said 5,291,280 patent, is an incorrect output — the low-frequency transition in the 
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presence of no motion results in an incorrect "motion" output (low amplitude motion signals 
at pixel positions in three lines). 

Referring now to Figure 8 (present invention — high-frequency vertical transition, no 
motion), Figure 8 A shows pixel amplitudes in consecutive lines in the video stream FO; 
5 Figure 8B shows pixel amplitudes in the video stream FI-V2H (262 lines in time after the FO 
video stream); and Figure 8C shows pixel amplitude values in the video stream Fl + VfeH 
(263 lines in time after the FO video stream). Subtractor 402 produces the output shown in 
Figure 8D (one pixel), while subtractor 404 produces the output shown in Figure 8E (no 
pixels). The keep smaller absolute value function 408 thus produces the output shown in 
10 Figure 8F (no pixels). Subtractor 406 produces the output shown in Figure 8G (one pixel). 
^ Since the subtractor 406 output, indicating vertical energy, is greater than the keep smaller 
absolute value function 408 output, the magnitude comparator cause the switch 412 to its no 
position, resulting in no output, indicating no motion. The output, shown in Figure 8H, is 
]Z the same as the prior art (see Figure 5), 

15 =: fl Referring now to Figure 9 (present invention — high-frequency vertical transition, 

motion), Figure 9A shows pixel amplitudes in consecutive lines in the video stream F0; 
Figure 9B shows pixel amplitudes in the video stream Fl-ViH (262 lines in time after the F0 
O video stream); and Figure 9C shows pixel amplitude values in the video stream Fl + ViH 
p (263 lines in time after the F0 video stream). Subtractor 402 produces the output shown in 

20 ^ Figure 9D (two pixels), while subtractor 404 produces the output shown in Figure 9E (three 
pixels) . The keep smaller absolute value function 408 thus produces the output shown in 
Figure 9F (two pixels). Subtractor 406 produces the output shown in Figure 9G (one pixel). 
Since the subtractor 406 output, indicating vertical energy, is less than the keep smaller 
absolute value function 408 output, the magnitude comparator moves the switch 412 to its 

25 yes position for two pixel times, resulting in the keep smaller absolute value output being 
provided as the field motion detector output, indicating motion with those amplitudes at the 
two pixel positions. For the remaining pixel positions, the subtractor 406 output is greater 
than the keep smaller absolute value function 408 output so that the magnitude comparator 
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places the switch 412 in its no position, resulting in no output, indicating no motion. The 
results, shown in Figure 9H, are the same as the prior art (see Figure 6). 

Referring now to Figure 10 (present invention — low-frequency vertical transition, no 

motion), Figure 10A shows pixel amplitudes in the video stream FO; Figure 10B shows pixel 
amplitudes in consecutive lines in the video stream FI-V2H (262 lines in time after the FO 
video stream); and Figure IOC shows pixel amplitude values in the video stream F1 + V4H 
(263 lines in time after the FO video stream). Subtracter 402 produces the output shown in 
Figure 10D (four low level pixels), while subtracter 404 produces the output shown in Figure 
10E (also four low level pixels). The keep smaller absolute value function 408 thus produces 

the output shown in Figure 10F (four low level pixels). Subtractor 406 produces the output 
shown in Figure 10G (four higher level pixels). Since the subtractor 406 output, indicating 
vertical energy, is more than the keep smaller absolute value function 408 output, the 
magnitude comparator moves the switch 412 to its no position, resulting in no output, 
indicating no motion. The output, shown in Figure 10H, is correct and different from the 
prior art (see Figure 8). Thus, the improved field motion detector of the present invention 
retains the ability of the prior art to differentiate high-frequency transitions from motion 
while adding the capability to differentiate low-frequency vertical transitions from motion. 

Frame Motion Detector 
The frame motion detector 124 (Figure 1), shown in detail in Figure 11, compares the 
FO and F2 video stream signals on a pixel by pixel basis to determine if the corresponding 
pixel in field Fl is likely to be in motion. Many prior art frame detectors may be used in 
aspects the present invention, although the improved frame motion, which is about to be 
described, is preferred. 

In the frame motion detector described in said U.S. Patent 4,982,280, high frequency 
motions are filtered out. This was done to keep any subcarrier residue from being detected 
as motion. The frame motion detector constituting an aspect of the present invention 
provides improved differentiation of motion and subcarrier signal components. 

In the improved frame motion detector forming a part of the present invention, a " 
frame subtraction is performed between FO and F2 in subtractor 1102. The subtractor 1102 
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output is then put through complementary lowpass and highpass filters created by a lowpass 
filter 1104 and subtractor 1106. In a digital embodiment, the lowpass filter may be a five- 
tap FIR filter with a zero at the color subcarrier frequency (in a frame detector intended for 
use with both NTSC and PAL, the filter characteristics may be switchable in accordance with 
whether an NTSC or PAL signal is being processed). The horizontal lowpass filtered path 
(where most of the motion lies) on line 1108 is rectified in rectifier 1110, and applied to a 
threshold function 1112 which removes noise components by applying a motion LPF noise 
threshold. In a digital implementation, the threshold 1112 may, for example, limit the signal 
to four bits (from, say, eight bits). .The horizontal highpass filtered path on line 1114 goes 
into complementary vertical lowpass and highpass filters created by subtractor 1116, adder 
1118 and delay 1120 (which provides a one-line delay for NTSC and a two-line delay in 
PAL). These paths, vertical HPF path 1122 and vertical LPF path 1124, are then 
individually rectified in rectifiers 1126 and 1128, respectively, and have their own thresholds 
(threshold functions 1130 and 1132, respectively, which remove noise components by 
applying a motion HHPF-VHPF Noise threshold and a motion HHPF-VLPF Noise threshold, 
respectively) (where HHPF is horizontal highpass filter, etc.). In a digital implementation, 
the thresholds 1130 and 1132 may also limit the respective signals to 4 bits. 

Thus, three paths are provided: a horizontally lowpass filtered (HLPF) path, a 
horizontally highpass filtered and vertically highpass filtered (HHPF-VHPF) path, and a 
horizontally highpass filtered and vertically lowpass filtered (HHPF-VHPF) path. The 
purpose of three paths is to separate color subcarrier signal components from true motion 
information. The HLPF path output has substantially no subcarrier signal components as a 
result of the horizontal lowpass filtering action of LPF 1104. The two HHPF paths, carrying 
the complement of the HLPF path, require vertical filtering to reduce the subcarrier signal 
components present in the high-frequency portion of the spectrum. Such components have 
the appearance of a vertical line pattern which may occur in real television scenes. The 
filtering action of the HHPF-VHPF path passes low amplitude level subcarrier signal 
components. By setting the HHPF-VHPF threshold sufficiently high, true motion is 
differentiated from the subcarrier components. The filtering action of the HHPF-VLPF path 
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rejects subcarrier components (which have a vertical component becaue they are out-of-phase 
from line to line) but passes horizontally moving patterns of lines (referred to as "moving 
multiburst") which must be detected as motion (such a pattern is rejected by the other two 
paths). The HHPF-VLPF path may have a lower threshold level than does the HHPF-VHPF 
5 path because the HHPF-VLPF path is not differentiating desired from undesired signal 

components based on amplitude. The HLPF and HHPF-VLPF threshold levels are selected 
for noise immunity. 

The three differently filtered motion paths are then combined in summer 1134 and 
expanded in expansion function 1136. Preferably, they are expanded horizontally by 5 

10 pixels, temporally by 1 field and vertically by 1 line. Techniques for such horizontal, 

□ vertical and temporal expansion are known. See for example, said U.S. Patent 5,488,422. 
• i The output of block 1136 is applied to one input of a keep greater value function 1138 and to 
JJ a 262H/312H (262 lines for NTSC, 312 lines for PAL) delay, the output of which is applied 
O as a second input to block 1138, and to a 1H (one line) delay 1142, the output of which is 
15 III applied as a third input to block 1138. Blocks 1136-1142 provide a temporally and vertically 
expanded motion signal. The purpose of the temporal and vertical expansion is twofold — to 
avoid the situation when a fast moving object leaves a "hole" between frames (e.g., a 
% swinging pendulum) and, in the case of an NTSC signal, to avoid an appearance of fluttering 
U between sharp and soft pictures when film material is not detected as film material. Also, 
20 expansion assures that the frame motion signal "surrounds" the field motion signal in the film 
detectors . 

The frame motion detector has three adjustable parameters: the motion LPF noise 
threshold, the motion HHPF-VLPF noise threshold, and the motion HHPF-VHPF noise 
threshold. The parameters should be adjusted to as to minimize false detection of motion 
25 caused by noise and subcarrier signal components. 

Film Detectors 

Functionally, an NTSC film detector 118 is provided separate from a PAL film 
detector 120. In practical embodiments, the two detectors are likely to share circuitry and/or 
functions. Both film detectors have the same purpose, to determine the onset of a film 
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pattern, thus entering the film mode (indicated by putting the film Y/N signal in its Y (yes) 
condition), and, after entering the film mode, when the film pattern is broken, thus exiting 
the film mode (indicated by putting the film Y/N signal in its N (no) condition). A different 
film pattern is native to each video format (NTSC or PAL) (resulting from 24 frame/second 
film sources in NTSC and 25 frame/second film sources in PAL). Both film detectors share 
certain characteristics and principles of operation. Both receive the F0 mm output of the field 
motion detector 114, the motion output of the frame motion detector 124, and the edit Y/N 
signal from the sawtooth based edit detector 126. The primary input is the field motion 
detector's FO^ signal. The frame motion signal serves as a verification of the field motion 
signal. The edit Y/N signal provides an early indication, based on the detection of signal 
conditions that produce sawtooth artifacts, that the film pattern has been broken. 

Typical film detectors in the prior art intended to detect 24 frame/second motion 
picture sources in NTSC television signals (see, e.g., said 4,982,280 and 5,291,280 patents) 
were frame based rather than field based. However, a field based film detector for detecting 
25 frame/second motion in PAL television signals is disclosed in Published International 
Patent Application WO 94/30006. The present NTSC and PAL film detectors are an 
improvement on the field based PAL film detector in said international application. Thus, 
for example, the frame based film pattern for NTSC is a five frame sequence — zero 
followed by four ones (as is well known, this pattern occurred for conditions of motion in the 
motion picture source, no motion caused the ones to become zeros, but the particular zero 
position had to remain a zero under all conditions to guarantee a film source) . In the 
improved field based detection technique of the present invention, three zero positions must 
remain zero in a five field sequence under all film motion conditions (the two one positions 
may assume either a one or zero condition), thus allowing faster detection of a disruption in 
the film pattern. 

In addition, the present film detector is responsive to an edit Y/N signal provided by 
the sawtooth based edit detection functions such that when the signal goes to its Y state, the 
film detector immediately leaves the film mode and the detector's film Y/N signal output 
immediately goes to N. The edit Y signal provides an indication within a field that the film 
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sequence pattern, thus providing an early indication before the film sequence pattern detector 

can make such a determination (the pattern detector requiring at least a field to detect an 

error). Thus, the edit Y/N signal provides a second way (the first being an interruption of 
the film pattern) to exit the film mode. 

In frame based edit detection there is one type of edit that is not detected because the 
sequence is not broken (the sequence AA ABB 122233 — see the discussion below under the 
heading "Sawtooth Detectors"). All edits are detected as bad edits in the field based 
detection (except for the one "good" edit, which is allowable — see above for example of a 
"good edit"). 

NTSC Film Detector 

The NTSC film detector 118 (Figure 1) receives the FO^ output of the field motion 
detector 114, the motion output of the frame motion detector 124, and the edit Y/N signal 
from the sawtooth based edit detector. NTSC film detector 118 looks for 24 frame/second 
{ sources in NTSC television signals. It examines the accumulated field motion between field 
0 and field 1 and then searches for the field motion film sequence pattern "1 0 1 0 0" before 
determining that the material is film and not video. Figure 12 shows the details of the NTSC 
film detector. 

The primary input to the NTSC film detector is the FO^ output of the field motion 
detector 114 (Figure 1). The F0 mm signal is filtered in low-pass filter 1202 to remove 
subcarrier residue, rectified in rectifier 1204, and blanked by film motion blanking function 
1206 to keep any picture edge artifacts and subtitles from being detected as motion. The 
rectified and blanked motion signal is then applied to a threshold 1208 to reduce noise 
artifacts. The threshold is set by a fixed and predefined film noise threshold. In a parallel 
path the motion input from the frame motion detector is thresholded by a threshold 1210 
into, in a digital implementation, a one-bit motion yes/no signal. This threshold is set by a 
motion threshold. The one-bit motion yes/no signal is then used to control a motion-no 
motion switch 1212. The frame motion indication serves as a verification of the field motion 
indication — if there is no frame motion at a given pixel then there cannot be any field 
motion for the same pixel (the frame motion has been expanded horizontally and vertically to 
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assure that the band of frame motion surrounds the field motion). The one-bit field motion 
signal from switch 1212 is then accumulated over the entire field in a field rate accumulator 
1214. 

At the end of each field the data is latched in and the motion value is then passed into 
5 three parallel paths, 1216, 1218 and 1220: 1) path 1216 to a minimum of five field detector 
1222 (although a minimum of five fields is preferred to provide a safety margin, a minimum 
of three fields is practicable), 2) path 1218 to a three field weighted averager 1224, and 3) 
path 1220 to one side of a subtractor 1226 where end-of-field calculations are performed. At 
the beginning of each field the accumulators are reset. 
10 Because the field motion is not precise, a non-moving field might have a non zero 

value associated with it. The minimum of five field detector 1222 looks at five adjacent fields 
for the minimum motion. For film in five adjacent fields there will always be a non-moving 
fg field, and therefore the minimum motion value will correspond to it. This output provides 
!;S the other input to the subtractor 1226 in order to remove the base motion value associated 
15-1 with a non-moving field from the accumulator 1214 output on path 1220. The subtractor 

1226 output is then passed to a variable threshold 1228 whose threshold is determined by the 
Ifz output of block 1224, the weighted average of three fields. 

U Inasmuch as the motion threshold is dependent on the motion of the picture 

O information, picture information with little motion will have a lower threshold than material 
20 ;i with lots of motion. The minimum motion detector 1222 and the weighted averager 1224 
greatly increase the sensitivity of film detection on small moving scenes and factor out 

computer generated material which violates the Nyquist criterion. The output of the variable 
threshold 1228, a "1" for a moving field and a "0" for a non-moving field, is then fed into a 
state machine 1230 that looks for the recurring sequence 10100. State machine 1230 may be 
25 of the type described in said 4,982,280 patent. Once the number of film sequences detected 
reaches the film sequence acquisition number, the film mode is activated. The film sequence 
acquisition number, the number of times the pattern must repeat to indicate detection of a 
film sequence, is set by the system designer. For example, the number might be three for 
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NTSC (although one might suffice) and five or six for PAL, which has a less unique film 
pattern. 

The transition to video mode occurs upon either of two conditions: 1) when the film 

sequence is disrupted by a "0" being replaced with a 'T' (note that once in film mode a "1" 
5 field can be replaced with a "0" field, as is the case of a non-moving field, but a non-moving 
field cannot be turned into a moving one), or 2) the edit Y/N signal goes high (yes). 

Because the NTSC film detector of the present invention is field based, there are three 
places in the five field sequence that a bad edit can cause the film detector to drop out of 
film mode compared to the one place under the frame motion film detection scheme — 
10 01111. As explained above, a "0" cannot become a 'T T but a "1" can become a "0" so the 

field motion sequence has three l? 0"s that can be used as an edit detector compared to the one 
u3 "0" on the frame based film sequence. This will have the effect of minimizing the negative 
g; visual effects of bad edits . 

]t The NTSC film detector has seven parameters: a film noise threshold, a motion 

15 III threshold, an NTSC film sequence acquisition number and four film blanking parameters that 
define the area of film motion detection in order to minimize edge effects: film blanking top 

line, film blanking bottom line, film blanking right side, and film blanking left side. 

ii y 

□ PAL Film Detector 

1=5 The PAL film detector 120 (Figure 1) looks for 25 frame/second sources in PAL 

20 l ^ television signals and it may also look for 30 frame/second sources in NTSC television 
signals . It examines the accumulated field motion between field 0 and field 1 and then 
searches for the field motion film sequence pattern n l 0 M before determiiiing if the material is 
film and not video. Details of the PAL film detector are shown in Figure 13. 

The PAL film detector receives the same FO field motion signal F0 mtn at its input and 
25 performs the same pixel rate processing as the NTSC film detector 118 (Figure 1) of Figure 
6. In practice, the lowpass filter 1202, the rectifier 1204, the blanking circuit 1206, the 
noise threshold 1208, the motion threshold 1210, the motion/no-motion switch 1212 and the 
field rate accumulator 1214 are common to the NTSC and PAL film detectors. 
Corresponding elements in the PAL film detector are assigned corresponding "13xx" 
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reference numerals. The common element, being described in connection with the NTSC 
film detector of Figure 6, will not be described again. 

The end-of-field calculations performed in PAL differ from NTSC because the 
sequence being detected in NTSC differs from the one being detected in PAL. The PAL 
recurring film sequence is "1 0." This sequence can occur on video material for a few 
frames in a row so care must be taken to differentiate this case. First the current field 
motion (B) from field rate accumulator 1314 is compared, in comparator 1340, in magnitude 
with the last field motion (A), derived by delaying the field rate accumulator 1314 output in 
a 1 field delay 1342 (which may be .implemented as a flip-flop clocked at the field rate by the 
vertical sync signal) to form a motion sign signal at the comparator 1340 output. 

If the current value is greater than the previous motion value a "1" is output, 
conversely a "0" is output if the current value is less than or equal to the previous motion. 
In a parallel path the two motions, the current field morion (B) and the last field motion (A), 
are modified by an operator 1344 to form the ratio | (A - B)/[(A + B)/2] | . This function, 
the absolute value of the difference divided by the average, corresponds to an adjacent field 
motion ratio, which is then passed to a threshold 1346 whose output is gated in AND-gate 
1348 with the motion sign signal. The introduction of the minimum motion constraint is 
required because the motion sign signal does not care if the difference between A and B is 1 
or 1000. For video material it is not uncommon to obtain motion values such as the 
following: 1000, 1410, 1400 and 1510. These would yield a "0 1 0 1" from the motion 
sign signal — a film-like pattern. A small motion film source might have the values as 
follows; 6, 100, 5, 110. Thus a PAL minimum motion ratio will help differentiate between 
film and video. Alternatively, but less desirably, the arrangement in the NTSC film detector 
between the field rate accumulator (1214 in Figure 12, 1314 in Figure 13) and the state 
machine (1230 in Figure 12, 1350 in Figure 13) may be used. 

If the minimum motion ratio constraint is met, then the motion sign is fed into the 
PAL film state machine 1350, which may be the same type of state machine as in said 
4,982,280 patent. If not, a 0 is input. PAL film mode is entered once the number of film 
sequences reaches the PAL film sequence acquisition number. 
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As in the case of NTSC the transition to video mode occurs upon either of two 
conditions: 1) when the film sequence is disrupted, or 2) when the edit Y/N signal goes 
high. It should be noted that a single film state machine may be used for the NTSC and 
PAL film detectors provided that a look-up table is used appropriate for the respective video 
standard. 

The PAL film detector, like the NTSC film detector, has eight adjustable parameters: 
a film noise threshold, a motion threshold, a PAL film sequence acquisition number, a PAL 
minimum motion ratio and four film blanking parameters that define the area of film motion 
detection: film blanking top line, film blanking bottom line, film blanking right side, film 
blanking left side. 

Sawtooth Detectors 

The sawtooth detectors 128 and 130 (Figure 1) look for the "sawtooth" artifact that an 
incorrect interleaving of Fl and F0 or Fl and F2 yields. Details of the sawtooth detectors 
are shown in Figure 14. 

A sawtooth is characterized by the even numbered lines having similar vertical 
content and the odd numbered lines also having similar content - the odd lines being 
horizontally offset from the even lines as depicted in Figures 15A-15D. Sawtooth artifacts 
are explained further in said 5,625,421 patent. Figure 15A shows the displayed appearance 
of a vertical bar when the lines are correctly interleaved. Figure 15B shows the same 
vertical bar when the lines are incorrectly interleaved. Figure 15C shows a diagonal bar 
when the lines are correctly interleaved. Figure 15D shows a diagonal bar when the lines 
are incorrectly interleaved. 

Each sawtooth detector includes a bank of four subtracters 1402, 1404, 1406 and 
1408. For ease of understanding, assume that even numbered lines are in field 0 and odd 
numbered lines are in field 1 (with reference to Figure 2, let the line containing pixel P F{HH 
be line 0, the line containing pixel P F0 line 2, the line containing pixel P F1 ., AH line 1 and the 
line containing pixel P F i+v4H ^ ne 3): 
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Subtracter 


Signals Subtracted 


Diagram 
Subtractions 


Sub tractor 1402 


(F0-1H) - FO 


Line 0 - Line 2 


Subtracter 1404 


FO - (F1 + V4H) 


Line 2 - Line 1 


Subtracter 1406 


(Fl-ViH) - 
(FI + V2E) 


Line 1 - Line 3 


Subtracter 1408 


(F0-1H) - (FI-V2H) 


Line 0 - Line 1 



Figure 14 and the above table shows signals present in the FO sawtooth detector. For 
the F2 sawtooth detector, substitute "Yl-ViK" for "FO-^H" and M F2" for 1 T0." 

The output of subtracters 1402-1408 proceed through four identical parallel paths 
1410-1416, each comprising four elements: a lowpass filter (1418, 1420, 1422 and 1424, 
respectively) to remove any subcarrier residue and noise; a rectifier (1426, 1428, 1430 and 
1432, respectively); an edit blanker (1434, 1436, 1438 and 1440, respectively) to keep any 
picture edge artifacts from being detected as a sawtooth, and a threshold (1442, 1444, 1446 
and 1448, respectively). The thresholds of the intra- field subtractions 1402 and 1406 are the 
same and require that the output be small thereby revealing that they align vertically . The 
thresholds of the inter-field subtractions 1404 and 1408 are the same and require that the 
output be large, indicating that there is a horizontal displacement of the transition from field 
to field. The outputs of the four threshold circuits are applied to an AND-gate 1450: the 
outputs of thresholds 1442 and 1446 are applied through an inverting input. Thus, the four 
signals are combined such that when the outputs of subtractors 1402 and 1406 are below the 
threshold and the outputs of subtractors 1404 and 1408 are above the threshold, a sawtooth is 
predicted. The output of AND-gate 1450 is F0 Saw (F2 Saw from the F2 sawtooth detector). 

The sawtooth detectors have six adjustable parameters: a sawtooth 2H similarity 
threshold, a sawtooth 1H difference threshold, and four edit blanking parameters that define 
the area of edit motion detection: edit blanking top line, edit blanking bottom line, edit 
blanking right side, and edit blanking left side. 



United States Patent Application of 
Peter D. Swartz 



Attorneys' Docket YCF153 



Although the F0 Saw and F2 Saw signals indicate when isolated sawtooth artifact signal 
conditions occur, more information is required to determine that a bad edit has occurred. 
The purpose of the sawtooth based edit detector 126 is to process the F0 Saw and F2 Saw signals 
along with other information to identify a bad edit with reasonable reliability. 

The sawtooth based edit detector 126 (Figure 1), receiving the F0 Saw and F2 Saw 
signals, the Fl-ViH signal and the frame motion signal, looks for predicted sawtooth artifacts 
on field 0 and field 2 relative to field 1 . The number of sawtooths per line are accumulated 
and compared with the number of moving horizontal transitions in the Fl -V2H signal. In the 
currently displayed field (Fl), if the ratio of sawtooths versus moving edges meets the 
minimum ratio and this ratio is met over a predefined number of adjacent lines, then an edit 
is detected. On the non-displayed or alternate field (FO or F2), if there are more sawtooths 
than moving edges and there are sawtooths detected on the currently displayed field and this 

occurs over a predefined number of adjacent lines, an edit is detected. The details of the 

sawtooth based edit detector are shown in Figure 16. 

The sawtooth based edit detector has two distinct input portions: a moving edge 
detecting input portion and a sawtooth detecting input portion. The moving edge detecting 
input portion receives the Fl-V^H signal input and applies it to a horizontal differentiator 
1602 which looks for horizontal transitions. This may be implemented by a one-clock (one 
pixel) delay 1604 and a subtracter 1606. The differentiator 1602 output is then filtered by a 
lowpass filter 1608, rectified by rectifier 1610, blanked by edit blanker 1612 and applied to a 
threshold 1614 which provides noise thresholding under control of an H edge (horizontal 
edge) threshold. In a parallel path the frame motion signal is also thresholded by threshold 
1616 under control of a motion threshold to form a motion yes/no signal which is then 
ANDed in AND-gate 1618 with the horizontal transition signal, allowing only moving 
horizontal edges to pass. Moving horizontal edges are applied to an H rate (horizontal rate) 
moving horizontal edge accumulator 1620 in which the number of distinct moving edges, 
determined by detecting their peaks, is accumulated over the course of one line. 

The sawtooth detection input portion receives the F0 Saw and F2 Saw outputs from the F0 
sawtooth detector and the F2 sawtooth detectors, respectively. The F0 Saw and F2 Saw signals 
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are applied to respective H min (horizontal minimum) detectors 1622 and 1624. For a 
sawtooth to be detected it must have a minimum horizontal duration of two pixels. The 
number of sawtooth instances having more than the minimum two pixel duration during the 
time period of each horizontal line are accumulated in respective H rate sawtooth instant 
accumulators 1626 and 1628. The number of instances for the FO and F2 signals are applied 
to two parallel paths, 1630 and 1632, representing the currently displayed field and the 
alternate field, respectively. The determination of which is FO and which is F2 depends on 
the state of the F0/F2 switch in the system (as determined by the NTSC or PAL film 
detector). The F0/F2 signal from the NTSC or PAL film detectors controls a double-pole, 

double-throw switch 1629 which reverses the F0 Saw and F2 Saw signal inputs depending on the 

polarity of the F0/F2 signal. 

In the currently displayed path 1630, the number of sawtooths on the currentiy 
displayed field is divided by the number of moving edges in a divider 1634. If the quotient 
is greater than the sawtooth moving edge ratio as determined by a threshold 1636, then a 
counter 1638 is incremented. If the next line does not meet the same criteria, the counter 
1638 is reset. Once the counter reaches the current field sawtooth V min threshold set by a 
threshold 1640, an edit is detected. 

There are certain cases when the currently displayed edit detector path 1630 fails. 
This is when the motion detection occurs between fields of different scenes, causing most of 
the field to appear to be in motion. This greatly raises the number of horizontal transitions 
that are seen to be moving, which has the effect of lowering the horizontal transitions to 
sawtooths ratio and keeping an edit from being detected. This is a problem principally when 
the two fields that are interleaved come from the same information — generally the post edit 
information (see example below). The alternate field edit detector in path 1632 addresses 
this problem. 

The alternate field edit detector portion of the sawtooth based edit detector takes as its 
input the number of detected sawtooths from the non-currently-displayed field. The number 
should generally be large if the field is in motion. The signal passes through a switch 1642 
controlled by the occurrence of any sawtooths in the currently displayed field. If there are 
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none, then the switch outputs 0, otherwise it outputs the number of sawtooths detected. This 
number is then compared with the number of moving edges in comparator 1644. If it is 
equal to or greater than the number of moving edges, the counter 1646 is incremented. If 
the next line does not meet the same criteria then the counter is reset. Once the counter 
1646 reaches the Alternate field sawtooth V Min threshold set by a threshold 1648, an edit is 
detected. The outputs of the two processing branches are applied to an OR gate 950 so that 
whichever branch detects an edit, that output is applied to a final switch 1652 which ensures 
that the edit detector only detects edits on film material. 

In order to better understand the operation of the sawtooth based edit detector, 
consider the following example. A film stream with an edit might look like AAABB122233, 
where the letters represent pre-edit information and the numbers represent post-edit material. 
Assume that the film detector interleaves 1 and 2 and there is motion between these fields. 
Therefore, sawtooths are visible (assume five sawtooths per line). This will also imply that 
there are five moving edges per line. The frame motion path subtracts B - 2, so all of the 
field is seen as moving. Now every edge in field 1 is seen as moving. If there are twenty 
edges, all twenty are seen as moving. The ratio of sawtooths versus moving edges in no 
longer 5/5, but 5/20 which lies below the threshold, so no edit is detected. The alternate 
field path now is detecting sawtooths between B and 1. This will be a number generally 
greater than or equal to the actual number of transitions in field 1. (Note: this is the case 
for moving film as well). For our example, assume thirty sawtooths were detected per line. 
Thirty sawtooths is greater than twenty moving edges and there are five sawtooths on the 
currently displayed field, so an edit is detected. For standard film there would be no 
sawtooths on the currently displayed field, so no edit would be detected. Because computer 
generated single line material can meet these requirements, a vertical minimum threshold has 
been imposed. 

The sawtooth based edit detector has nine adjustable parameters: an H edge 
threshold, a motion threshold, a sawtooth moving edge ratio, a current field sawtooth V min 
threshold, an alternate field sawtooth V min threshold and four edit blanking parameters that 
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define the area of edit detection: edit blanking top line, edit blanking bottom line, edit 
blanking right side, edit blanking left side. 
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